Prediction of protein-ligand interactions from paired protein sequence motifs and ligand substructures.
نویسندگان
چکیده
Identification of small molecule ligands that bind to proteins is a critical step in drug discovery. Computational methods have been developed to accelerate the prediction of protein-ligand binding, but often depend on 3D protein structures. As only a limited number of protein 3D structures have been resolved, the ability to predict protein-ligand interactions without relying on a 3D representation would be highly valuable. We use an interpretable confidence-rated boosting algorithm to predict protein-ligand interactions with high accuracy from ligand chemical substructures and protein 1D sequence motifs, without relying on 3D protein structures. We compare several protein motif definitions, assess generalization of our model's predictions to unseen proteins and ligands, demonstrate recovery of well established interactions and identify globally predictive protein-ligand motif pairs. By bridging biological and chemical perspectives, we demonstrate that it is possible to predict protein-ligand interactions using only motif-based features and that interpretation of these features can reveal new insights into the molecular mechanics underlying each interaction. Our work also lays a foundation to explore more predictive feature sets and sophisticated machine learning approaches as well as other applications, such as predicting unintended interactions or the effects of mutations.
منابع مشابه
P-31: The Alteration of SpermatogenesisHas A Correlation with Sertoli Cell Mitochondrial Abnormal Morphology in Cytotoxicity of Testicular Tissue Mediatedwith Monosodium
Background: Male infertility has many causes, including genetic infertility. The NOP2/Sun domain family, member7 (Nsun7) gene, which encodes putative methyltransferase Nsun7, has a role in sperm motility. The aim of the present study was to investigate the effect of the T26248G polymorphism on Nsun7 protein function and its role in male infertility. Materials and Methods: Semen samples were col...
متن کاملP-30: The Effect of The T26248G Polymorphism on Putative MethyltransferaseNsun7 Protein Function and Its Role in Male Infertility
Background: Male infertility has many causes, including genetic infertility. The NOP2/Sun domain family, member7 (Nsun7) gene, which encodes putative methyltransferase Nsun7, has a role in sperm motility. The aim of the present study was to investigate the effect of the T26248G polymorphism on Nsun7 protein function and its role in male infertility. Materials and Methods: Semen samples were col...
متن کاملBiological Applications of Isothermal Titration Calorimetry
Most of the biological phenomena are influenced by intermolecular recognition and interaction. Thus, understanding the thermodynamics of biomacromolecule ligand interaction is a very interesting area in biochemistry and biotechnology. One of the most powerful techniques to obtain precise information about the energetics of (bio) molecules binding to other biological macromolecules is isoth...
متن کاملConserved Core Substructures in the Overlay of Protein-Ligand Complexes
The method of conserved core substructure matching (CSM) for the overlay of protein-ligand complexes is described. The method relies upon distance geometry to align structurally similar substructures without regard to sequence similarity onto substructures from a reference protein empirically selected to include key determinants of binding site location and geometry. The error in ligand positio...
متن کاملBiochemical Aspects of Protein Changes in Seed Physiology and Germination
Seed storage proteins are synthesized as sources of carbon, nitrogen and sulfur for the next generation of plants. Reactive oxygen species serve as second messengers for signal transduction; however, molecular targets of oxidant signaling have not been defined. Here, many researchers showes that ligand–receptor mediated signaling promotes reactive oxygen species– dependent protein carbonylation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره 23 شماره
صفحات -
تاریخ انتشار 2018